Automatically Identifying the Source Words of Lexical Blends in English
نویسندگان
چکیده
Newly coined words pose problems for natural language processing systems because they are not in a system’s lexicon, and therefore no lexical information is available for such words. A common way to form new words is lexical blending, as in cosmeceutical, a blend of cosmetic and pharmaceutical. We propose a statistical model for inferring a blend’s source words drawing on observed linguistic properties of blends; these properties are largely based on the recognizability of the source words in a blend. We annotate a set of 1,186 recently coined expressions which includes 515 blends, and evaluate our methods on a 324-item subset. In this first study of novel blends we achieve an accuracy of 40% on the task of inferring a blend’s source words, which corresponds to a reduction in error rate of 39% over an informed baseline. We also give preliminary results showing that our features for source word identification can be used to distinguish blends from other kinds of novel words.
منابع مشابه
Using social media to find English lexical blends 1
We present a method for identifying English lexical blends — words such as complisult (compliment + insult) and globesity (global + obesity) — from social media, specifically Twitter. Our method is based on observations about words and phrases that are commonly used to introduce new words and corpus patterns that are often used to describe the meaning of lexical blends, and leverages the massiv...
متن کاملEmergent Faithfulness to Proper Nouns in Novel English Blends
Emergent effects (McCarthy & Prince 1994) are the result of phonological constraints or rankings that only reveal themselves in a specific context. That is, they have no discernable effect in the regular phonology of a language but become apparent when speakers perform particular tasks. Crucially, they reveal knowledge that was not learned directly from ambient language data. Emergent effects h...
متن کاملThe Effect of Raising Morphological Decomposition Awareness on Lexical Knowledge of Complex English Words
Lexical knowledge of complex English words is an important part of language skills and crucial for fluent language use. This study aimed to assess the role of morphological decomposition awareness as a vocabulary learning strategy on learners’ productive and receptive recall and recognition of complex English words. University students majoring English at the...
متن کاملHead Faithfulness in Lexical Blends: a Positional Approach to Blend Formation
KATHERINE SHAW: Head Faithfulness in Lexical Blends: A Positional Approach to Blend Formation (Under the direction of Elliott Moreton) This thesis applies Positional Faithfulness theory (Beckman 1998) to the problem of lexical blending in English. Lexical blends, like brunch or motel, contract multiple source words into a single lexical item shaped by competing sets of phonological and psycholi...
متن کاملFirst Language Activation during Second Language Lexical Processing in a Sentential Context
Lexicalization-patterns, the way words are mapped onto concepts, differ from one language to another. This study investigated the influence of first language (L1) lexicalization patterns on the processing of second language (L2) words in sentential contexts by both less proficient and more proficient Persian learners of English. The focus was on cases where two different senses of a polys...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computational Linguistics
دوره 36 شماره
صفحات -
تاریخ انتشار 2010